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Biological  protein  materials  feature  hierarchical  structures  that  make  up  a  diverse  range  of  physiological  materials.  The  analysis 
of  protein  materials  is  an  emerging  field  that  uses  the  relationships  between  biological  structures,  processes  and  properties  to 
probe  deformation  and  failure  phenomena  at  the  molecular  and  microscopic  level.  Here  we  discuss  how  advanced  experimental, 
computational  and  theoretical  methods  can  be  used  to  assess  structure-process-property  relations  and  to  monitor  and  predict 
mechanisms  associated  with  failure  of  protein  materials.  Case  studies  are  presented  to  examine  failure  phenomena  in  the 
progression  of  disease.  From  this  materials  science  perspective,  a  de  novo  basis  for  understanding  biological  processes  can 
be  used  to  develop  new  approaches  for  treating  medical  disorders.  We  highlight  opportunities  to  use  knowledge  gained  from 
the  integration  of  multiple  scales  with  physical,  biological  and  chemical  concepts  for  potential  applications  in  materials  design 


and  nanotechnology. 

Proteins  are  essential  building  blocks  of  life,  forming  a  diverse 
group  of  biological  materials  that  ranges  from  spider  silk 
to  bone,  and  from  tendons  to  the  skin,  all  of  which  play  an 
important  part  in  providing  key  biological  functions1-7.  These 
materials  are  distinct  from  the  conventional  categories  of  structure 
and  material,  as  they  represent  the  merger  of  these  two  concepts 
through  hierarchical  formation  of  structural  elements  that  range 
from  the  nanoscale  to  the  macroscale  (Fig.  la,  b). 

Protein  materials  are  abundant  in  biology  and  play  an  essential 
part  in  the  biological  function  of  all  cells  and  tissues  within 
organisms.  Many  such  materials  with  mechanical  function  form 
structural  filaments,  trusses  or  fibres,  whereas  others  retain  the 
globular  structure  of  their  protein  constituents.  Biological  protein 
materials  are  commonly  characterized  by  where  they  reside  with 
respect  to  their  associated  tissue,  to  serve  as  either  extracellular 
or  intracellular  protein  materials1.  Intracellular  protein  materials 
provide  the  architecture  of  a  cell  and  include  vimentin,  microtubules, 
actin  (proteins  from  a  cells  cytoskeleton)  and  lamins  (forming  the 
cells  nuclear  envelope)1-3.  Extracellular  protein  materials,  secreted 
by  cells  into  the  surrounding  microenvironment,  include  elastin 
and  collagen1’4,5.  Other  protein  materials  such  as  fibrin6-9  appear  in 
biological  processes  such  as  the  clotting  of  blood.  Distinct  protein 
materials  in  organisms  cannot  be  considered  in  isolation.  For 
instance,  lamins  are  connected  to  extracellular  protein  materials 
(such  as  collagen)  through  cytoskeletal  proteins  (for  example  actin 
and  nesprin). 

Here  we  focus  on  defining  structural  components  of  biological 
protein  materials  from  a  materials  science  perspective,  and  on  their 
role  in  mechanical  materials  phenomena,  specifically  materials 
failure,  in  diseased  or  altered  physiological  conditions.  Advances  in 
experimental,  theoretical  and  computational  materials  science  have 
led  to  a  deeper  understanding  through  the  linking  of  structure  to 
process  and  property  (Fig.  lc).  An  overview  of  failure  in  biological 
systems,  broadly  defined  as  the  loss  of  its  ability  to  provide  an 
intended  physiological  function,  will  be  specifically  presented  in  the 
context  of  material  breakdown  due  to  a  range  of  causes,  including 
altered  chemical  or  physical  boundary  conditions  (such  as  extreme 
forces),  weakening  of  tissues  due  to  structural  flaws  because  of 


genetic  defects,  or  the  inability  of  a  tissue  to  provide  its  function 
because  of  the  interaction  with  an  ectopic  or  foreign  material. 

Within  the  biological  sciences,  the  field  of  genomics  has 
advanced  our  knowledge  base  through  the  successful  sequencing  of 
entire  genomes.  Extensive  efforts  have  been  made  to  move  beyond 
genomics,  where  fields  such  as  systems  biology  provide  explanations 
of  mechanisms  of  how  genes  affect  phenotypes  and  biological 
function10,11.  More  recently,  a  movement  to  understand  the  materials 
science  of  proteins,  aimed  at  developing  relationships  between 
structure  (of  biological  protein  materials),  process  (for  example  self 
assembly)  and  property  (for  example  strength  or  elasticity),  shows 
great  promise  in  contributing  to  a  deeper  understanding  of  biological 
systems.  This  study  of  material  properties  of  hierarchical  protein 
structures  and  their  effect  on  molecular  and  microscopic  properties, 
by  using  mechanistic  insight  based  on  structure-process-property 
relations  in  the  biological  context  (Fig.  Id),  provides  a  basis  for 
understanding  disease  processes.  This  could  help  in  developing  new 
approaches  to  treating  genetic  and  infectious  diseases,  injury  and 
trauma,  as  well  as  in  improving  engineered  materials  by  translating 
material  concepts  from  biology.  Associated  materials  science 
challenges  and  opportunities  are  summarized  in  Table  1. 

Materials  in  biology  often  involve  non-protein  components 
such  as  crystal  platelets  (in  bone,  dentin  and  nacre,  for  example), 
which  have  important  effects  on  their  mechanical  properties. 
They  also  contain  larger-scale  structural  features,  which  influence 
the  overall  mechanical  properties  (for  example  porosities  and 
interaction  with  fluids).  Although  proteins  are  an  important 
element  that  affects  most  materials  in  biology,  they  alone  do  not 
provide  the  information  necessary  to  characterize  all  relevant 
material  properties.  The  discussion  of  these  effects  is  beyond  the 
scope  of  this  review. 

The  cascaded  arrangements  of  building  blocks  at  defined 
length  scales  form  hierarchical  structures  (for  example  molecules, 
filaments,  mesoscale  structures)  which  control  a  materials 
properties.  Figure  la,  b  shows  the  structure  of  two  example 
materials,  showing  lamin  intermediate  filaments  (in  the  following 
referred  to  as  ‘lamins’) 12-15  and  collagen  (for  example  tendon  and 
cornea)4,5,  and  illustrates  how  structure  and  material  converge. 
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Figure  1 1  Examples  of  hierarchical  multiscale  structures  of  biological  protein  materials.  Here  we  show  lamin  intermediate  filaments  (lamins), 
collagenous  tissues  (tendon)  and  the  materials  science  paradigm  for  the  analysis  of  biological  protein  materials,  a,  Structure  of  lamins.  a-Helical  protein 
domains  assemble  into  dimers,  which  form  filaments  that  define  a  lattice-like  lamin  network  of  the  cell's  nucleus.  Lamina  mesh  network  snapshot  adapted 
and  reprinted  with  permission  from  ref.  16.  ©  1986  NPG.  Cell  nucleus  image  ©TenOfAIITrades.  b,  Structure  of  collagenous  tissues,  from  nanoscale 
to  macroscale.  Tropocollagen  (TC)  molecules  assemble  to  form  fibrils,  which  form  fibres  that  provide  the  structural  basis  of  collagen  tissues  such  as 
tendon.  Collagen  fibril  image  reprinted  with  permission  from  ref.  76.  ©  2002  Elsevier.  Image  of  Achilles  tendon  from  SPL.  c,  Materials  science  triangle 
that  links  structure,  process  and  property,  d,  Materials  science  paradigm  applied  to  the  hierarchical  structure  of  protein  materials  (H,  refers  to  hierarchy 
levels  /  =  0  to  N,  R,  refers  to  material  property  requirements  at  hierarchy  levels  /  =  0  to  N).  The  cycle  initiates  at  H0  (process  H0,  the  only  level  at  which 
protein  expression  occurs)  to  form  protein  constituents  (structure  H0).  Their  properties  (property  H0)  control  the  association  at  the  next  hierarchical  level 
(process  leading  to  structure  H^.  This  cycle  continues  through  all  hierarchical  levels  /  =  0  to  N,  where  process  and  structure  H,  and  beyond  denote 
protein  assembly  stages.  At  each  stage,  the  properties  of  the  structure  control  the  assembly  at  the  next  level.  Overall,  properties  at  different  hierarchical 
levels  (properties  H,)  are  regulated  by  corresponding  physiological  demands  (requirements  R(),  which  are  sensed  and  transduced  intracellularly  to  activate 
genetic  regulation,  resulting  in  changes  to  process  H0. 


Lamins  provide  structural  support  to  the  cells  nucleus  and 
form  an  interface  between  the  cells  cytoskeleton  and  chromatin. 
Figure  la  shows  the  structure  of  lamins12-15,  characterized  by  a 
hierarchical  assembly  of  a-helix-based  protein  domains.  The  basic 
building  block  of  lamins  is  an  a-helical  coiled-coil  dimeric  molecule. 
Lamin  dimers  form  filaments  that  develop  into  a  network  with  a 
lattice-like  structure,  found  primarily  at  a  cells  nuclear  membrane16. 
Because  the  cells  cytoskeleton  itself  is  coupled  to  the  surrounding 
extracellular  matrix  (for  example  collagen)  by  adhesion  proteins, 
macroscopic  deformation  of  tissues  typically  leads  to  deformation 
of  the  cell  nucleus17-20.  The  lamin  network  thus  aids  the  coupling 
of  tissue-level  mechanical  signals  to  complex  biochemical  processes 
(such  as  gene  regulation)  within  the  cell  nucleus16’17,21,22.  The  study 


of  strain- regulated  mechanisms  of  cell  response  falls  broadly  into 
the  field  of  mechanotransduction3,23.  Lamins  represent  a  model 
material  that  illustrates  the  intimate  connection  between  structural 
material  and  biochemical  properties.  Recent  studies  have  provided 
substantial  evidence  that  underlines  the  role  of  lamins  in  cancer  and 
genetic  diseases17,21,22,24. 

Collagen,  the  most  abundant  protein  on  the  Earth,  is  a  fibrous 
structural  protein  with  superior  mechanical  properties.  It  is  found  in 
a  number  of  tissues  exposed  to  severe  tensile  or  compressive  loading, 
including  tendon,  bone,  teeth,  cartilage  and  the  cornea4,25-29.  The 
hierarchical  structure  of  collagen  is  summarized  in  Fig.  lb.  Collagen 
consists  of  triple  helical  tropocollagen  molecules  that  appear  as 
nano-ropes’  with  lengths  of  around  300  nm  (ref.  30).  Staggered 
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Table  1 1  Key  materials  science  challenges  associated  with  the  study  of  biological  protein  materials,  examples  and  opportunities  for 
further  development  of  methods. 


Materials  science  challenge  Example 


Significance 


Need  for  model  development 


Confinement  effects 


Environmental  effects 


Material  failure 


Multiscale  synthesis  or 
fabrication 


Multiscale  characterization 


Change  of  material  properties  at 
different  length  scales  and  timescales 
(for  example  size  effects,  pulling  rate 
effects) 

Effects  of  chemicals  and  solvent  (for 
example  pH)  on  formation,  function  and 
material  breakdown 

Material  degradation  in  genetic  and 
infectious  diseases,  failure  due  to  impact 
and  stress  in  injuries  or  trauma 


Structure  control,  from  nanoscale 
to  macroscale;  how  to  control 
implementation  of  hierarchies 


Structure  measurement  at  various 
timescales  and  length  scales 
(for  example  assembly  stages), 
characterization  of  mechanisms  of 
deformation 


Adaptation  of  universal  or  diverse 
protein  structures,  biomaterials 


Tissue  remodelling,  protein 
filament  adaptation/remodelling, 
mechanotransduction 
(chemomechanical  signalling) 

Mechanisms  of  genetic  diseases  (for 
example  osteogenesis  imperfecta, 
progeria,  muscle  dystrophies, 
Alzheimer's  disease),  nanomedicine 


Synthesis  of  synthetic  biofibres  (such 
as  spider  silk)  and  bioinspired  or 
biomimetic  materials  (scaffolding 
protein  materials  for  tissue  engineering, 
interfaces  between  microdevices  and 
biological  systems) 


Constitutive  material  models  with 
explicit  consideration  of  hierarchies 


Chemomechanical  models  of  tissues, 
atomistic-level  descriptions  that 
integrate  mechanics  and  chemistry 

Constitutive  strength  models  with 
explicit  consideration  of  hierarchies, 
plasticity/failure  models  (for  example 
rupture  of  hydrogen  bonds),  disease 
models  (such  as  role  of  molecular 
defects  and  misfolds) 

Integration  of  universal  and  diverse 
elements  in  hierarchical  structures, 
translation  from  biology  to 
nanotechnology  (carbon  nanotubes, 
nanowires,  peptides) 


Application  of  atomic  force  microscopy,  Statistical  analysis,  high-throughput 

optical/magnetic  tweezers,  microfluidic  methods,  error/uncertainty  analysis 

devices,  transmission  electron 

microscopy  for  material  characterization, 

integration  of  measuring  and  imaging 

approaches  across  multiple  scales  to  aid 

model  development 


arrays  of  tropocollagen  molecules  form  fibrils,  which  combine  to 
form  collagen  fibres  and  then  to  form  tissues  such  as  tendon,  which 
functions  as  glue  between  muscle  and  bone.  Tropocollagen  molecules 
show  extreme  extensibility,  withstanding  up  to  50%  tensile  strain 
before  catastrophic  failure.  This,  along  with  their  ability  to  sustain 
stresses  of  several  gigapascals,  is  crucial  for  their  physiological 
mechanical  role  in  tissues31.  The  staggered  arrangement  of  molecules 
into  fibrils  provides  the  basis  for  its  ability  to  dissipate  mechanical 
energy  through  molecular  sliding  rather  than  leading  to  catastrophic 
failure32-34.  This  staggered  architecture  plays  a  key  part  in  increasing 
the  toughness  of  various  collagen  materials  such  as  tendon  or  bone. 
The  overall  hierarchical  arrangement  is  important  because,  through 
structures  formed  at  characteristic  geometric  length  scales,  it  enables 
superior  molecular  properties  to  be  visible  at  larger,  biologically 
important,  intermediate  mesoscales.  A  possible  concept  to  explain 
the  observed  length  scales  at  each  hierarchical  level  is  that  they  are 
a  result  of  structural  adaptation  towards  maximizing  target  material 
properties  (such  as  strength  and  dissipation)  through  geometric 
size  effects,  as  discussed  in  the  context  of  bone35.  Several  genetic 
diseases  are  related  to  defects  in  the  collagen  structure,  leading  to 
mechanically  compromised  tissues  (an  example  is  the  brittle  bone 
disease  osteogenesis  imperfecta). 

The  biological  materials  introduced  above  feature  properties 
that  are  characteristic  for  this  entire  class  of  materials,  including 
robustness,  adaptability  and  multifunctionality411.  Robustness  is 
defined  as  the  degree  of  separation  between  stability  and  failure, 
and  is  crucial  to  understanding  the  role  of  protein  materials  in 
biology,  for  both  physiological  and  pathological  conditions  (such 
as  varying  pH,  forces  or  structural  changes).  Adaptability  refers 
to  the  ability  of  a  material  to  cope  with  environmental  changes  by 
changing  its  structural  arrangement  to  cope  better  with  changed 
conditions.  For  example,  tissue  remodelling  in  bone  plays  a  crucial 
part  in  improving  the  materials  damage  tolerance  through  its 
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intrinsic  ability  to  repair  itself36.  The  underlying  mechanism  is  that 
small  cracks,  formed  because  of  physiological  mechanical  load,  are 
detected  by  Tone  remodelling  units’  and  removed  from  the  tissue 
before  they  reach  a  critical  size  at  which  catastrophic  failure  would 
occur.  Such  mechanisms  show  the  intricate  connections  between 
tissue  formation  and  overall  failure  properties.  In  vivo ,  the  structures 
of  protein  materials  self-maintain  or  adapt  by  means  of  feedback 
loops  by  translating  spontaneous  demands  in  the  microenvironment 
(through  intracellular  signalling)  to  regulate  gene  activation  or 
deactivation.  Ultimately,  this  alters  the  material’s  structural  makeup 
to  better  suit  the  local  physiological  needs  (Fig.  Id).  Multifunctionality 
refers  to  the  material’s  ability  to  provide  multiple  functions  such  as 
mechanical  strength  and  the  ability  to  control  biochemical  processes 
(as  observed  in  lamins  that  have  structural  as  well  as  biochemical 
roles).  Many  protein  materials  show  graceful  degradation,  reflecting 
the  material’s  ability  to  induce  a  controlled  breakdown  of  a  system’s 
function  after  damage  without  leading  to  catastrophic  failure  and 
without  affecting  materials  in  the  environment.  The  hierarchical 
structure  maybe  the  basis  for  these  unique  properties,  distinguishing 
them  from  many  engineered  materials. 

The  formation  of  protein  materials  in  biological  systems  occurs 
through  self  assembly  of  protein  molecules  whose  structure  is 
encoded  in  an  organism’s  DNA.  Protein  molecules  are  composed 
of  long  polymer  chains,  constructed  of  amino  acid  monomers, 
and  synthesized  by  a  translation  process  that  converts  the  genetic 
information  transcribed  from  DNA  into  RNA  into  a  polypeptide 
sequence  of  amino  acids.  Protein  synthesis  is  initiated  at  precise 
sites  during  mRNA  translation  and  terminated  when  signalled  by 
the  presence  of  encoded  stop  codons  (messages).  Once  a  protein 
molecule  is  constituted,  the  polypeptide  chain  folds  into  its  unique 
three-dimensional  (3D)  conformation  with  the  aid  of  chaperone 
proteins1.  Functional  activation  of  the  protein  structure  often 
involves  binding  with  other  molecules  to  form  the  protein’s  intended 
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Figure  2  |  Formation  and  assembly  of  biological  protein  materials,  including  cellular  expression  of  proteins  and  assembly  into  larger-scale  hierarchical 
structures,  a,  Generation  of  protein  molecules  through  translation  from  the  DNA  genetic  code,  leading  to  the  expression  of  individual  3D  protein 
molecules,  b,  Schematic  biological  assembly  of  lamins14  and  experimental  visualization.  The  self  assembly  of  protein  molecules  into  large-scale  structures 
proceeds  through  a  complex  process  that  involves  several  intermediate  structural  steps.  For  lamins,  assembly  occurs  first  by  association  of  two  dimers 
into  tetramers,  then  by  a  period  of  growth  in  the  axial  direction  (I),  followed  by  growth  in  the  radial  direction  (II  and  III).  Experimental  visualizations  of 
the  assembly  process  of  lamins  are  obtained  from  electron  microscopy  studies  (rat  liver  lamin  A/C,  scale  bar  100  nm).  Adapted  and  reprinted  with 
permission  from  ref.  14.  ©  2004  Annual  Reviews,  c,  Experimental  visualization  of  the  assembly  of  tropocollagen  into  collagen  microfibrils  based  on 
contact-mode  atomic  force  microscopy  height  images.  The  characteristic  regular-spaced  banding  structure  of  collagen  microfibrils  is  visible  in  the  marked 
region  where  multiple  tropocollagen  molecules  begin  to  interact  (black,  0  nm  height,  white,  35  nm  height).  Adapted  and  reprinted  with  permission  from 
ref.  76.  ©  2002  Elsevier,  d,  Schematic  diagram  of  the  biological  process  of  formation  of  spider  silk  protein  filaments  in  a  spider's  spinning  duct,  involving  a 
controlled  change  of  pH,  solvent  chemicals  and  mechanical  shear,  leading  to  the  formation  of  the  (3-sheet-rich  spider  silk  filaments39,  e,  Synthetic  process 
of  spider  silk  formation,  realized  in  a  microfluidic  device  (left),  together  with  a  micrograph  (right)  of  the  resulting  synthetic  fibres  (scale  bar  10  pm). 
Adapted  and  reprinted  with  permission  from  ref.  39.  ©  2008  National  Academy  of  Sciences  USA. 


functional  structure1  (Fig.  2a).  In  vivo,  many  protein  materials  obtain 
their  specific  functionality  by  post-translational  modifications, 
which  include  hydroxylations  (for  example  during  formation  of 
tropocollagen  s  hydroxyproline  residues),  phosphorylations  and 
glycosylations  (for  example  during  formation  of  cartilage  tissues),  as 
well  as  enzymatic  crosslinking.  These  modifications  are  particularly 
important  for  the  material  properties  of  tissues,  as  they  control  the 
interaction  between  proteins  and  with  other  material  components 
(such  as  inorganic  materials  or  sugar-based  components)  as  well 
as  their  bioactive  properties.  These  modifications  are  particularly 
difficult  to  mimic  ex  vivo  or  through  synthetic  approaches, 


posing  a  challenge  in  the  development  of  biomimicking  and 
biocompatible  materials. 

The  resultant  individual  protein  components  self-assemble  at 
different  timescales  to  form  a  protein  structure  with  hierarchical 
geometrical  entities  (Fig.  la,  b).  Assembly  mechanisms  have  been 
investigated  based  on  the  combination  of  imaging  tools  with 
controlled  assembly  conditions  (pH,  temperature,  solvent).  This 
approach  has  elucidated  the  mechanism  of  lamin  assembly14,37 
(Fig.  2b).  In  this  process,  lamin  dimer  association  and  axial  growth 
are  followed  by  radial  growth  to  form  larger  fibrils,  resulting 
in  formation  of  a  network.  Figure  2b  also  shows  the  direct 
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Figure  3  |  Universality  and  diversity  of  biological  protein  materials,  a,  Degree  of  structural  diversity  and  universality  in  protein  materials,  plotted  against 
biological  length  scales  (equivalent  to  hierarchical  levels).  Inset,  a  visualization  of  the  topoisomerase  protein  (biological  role  is  to  cut  strands  of  the  DNA 
double  helix;  visualization  using  visual  molecular  dynamics140).  This  inset  illustrates  how  universal  motifs  define  the  overall  functional  properties  of  this 
protein,  whereas  the  entire  protein  structure  represents  diversity.  See  also  Table  2.  AH,  a-helix;  BS,  (3-sheet;  RC,  random  coil;  AA,  amino  acid,  b,  Placement  of 
mesoscale  science  of  protein  materials,  between  single-molecule  studies  and  micromechanical  and  continuum-type  biomechanics  applications. 


Table  2  |  Occurrence  of  protocols  in  four  sample  materials,  illustrating  that  some  protocols  are  universal  to  all  protein  materials 
whereas  others  are  specific. 


Protocol 

Biological  protein  material 

Collagen  (tendon) 

Lamin 

Spider  silk 

Amyloids 

DNA  nucleotides*  (ACGT) 

/ 

/ 

/ 

/ 

Vimentin  DNA  sequence1 

/ 

Collagen  DNA  sequence1 

/ 

Spicier  silk  DNA  sequence1 

/ 

Amyloid  DNA  sequence1 

/ 

DNA  double-helical  structure* 

/ 

/ 

/ 

/ 

Protein  building  blocks*  (20  AAs) 

/ 

/ 

/ 

/ 

Motifs*  AH,  BS,  RC 

/ 

/ 

/ 

/ 

Collagen  fibril1 

/ 

Lamin  filaments1 

/ 

(3-Nanocrystal  composite1 

/ 

Cross-(3  fibril  structure1 

/ 

*Universal;  diverse.  AH,  a-helix;  BS,  |3-sheet;  RC,  random  coil;  AA,  amino  acid. 


experimental  visualization  of  these  assembly  processes.  A  snapshot 
of  the  assembly  process  for  collagen  fibrils  is  shown  in  Fig.  2c, 
revealing  the  characteristic  banding  structure  of  collagen  fibrils 
when  multiple  molecules  assemble  into  microfibrils.  The  biological 
process  of  in  vivo  assembly  of  many  structural  protein  materials 
involves  a  dynamic  change  of  physical  and  chemical  conditions,  as 
has  been  shown  for  the  synthesis  of  spider  silk38,39  (Fig.  2d). 

In  addition  to  studies  of  naturally  occurring  biological  protein 
materials,  recent  research  led  to  the  development  of  techniques 
that  enable  one  to  change  their  structural  makeup,  and  to  design 
and  synthesize  synthetic  analogues40-44  through  recombinant  DNA 
techniques,  RNA  interference  knockdowns  or  sequence  insertions. 
Two  primary  routes  of  development  pursued  include  ex  vivo 
assembly  (for  example  self- assembling  peptide  systems40,41,44)  and 
in  vivo  expression  of  protein  materials  (for  example  through  bacterial 
hosts42,45).  The  ability  to  control  the  DNA  sequence  information  at 
a  fundamental  level  provides  us  with  the  ability  to  engineer  the 
structure  of  protein  materials  at  the  molecular  (amino  acid)  scale. 
EAK16,  for  example,  belongs  to  an  interesting  class  of  self- assembled 
peptides  that  constitutes  a  material  platform  for  a  variety  of  biological, 
biomimetic  and  nanotechnology  applications46.  Knowledge  of  the 
details  of  the  in  vivo  processes  enables  one  to  mimic  these  processes 
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ex  vivo ,  as  has  been  demonstrated  for  spider  silk  fibres  in  microfluidic 
devices39  (Fig.  2e). 

The  ability  to  control  self  assembly  synthetically  and  genetically 
provides  a  powerful  framework  for  the  study  of  the  links  between 
structure,  property  and  process.  The  approach  of  adding  and 
deleting  molecular  domains  with  distinct  chemical  functionality 
has  made  possible  the  detailed  study  of  the  mechanisms  of  self- 
assembly  processes  (for  example  identification  of  essential  protein 
domains)47-50.  This  allows  one  to  determine  how  changes  to  the 
protein  structure  alter  biological  function  and  disease  properties. 

The  evolution  of  protein  materials  through  genetic  selection 
and  structural  alterations  has  resulted  in  a  specific  set  of  protein 
building  blocks  that  define  their  structure.  Protein  materials  exist 
in  abundant  variety,  and  the  need  exists  to  formulate  a  widely 
applicable  model  to  categorize  all  such  materials  systematically,  in 
order  to  establish  a  fundamental  understanding,  and  to  make  use  of 
hierarchical  structural  building  blocks  to  develop  a  new  generation 
of  advanced  nanomaterials51-53.  A  protocol  is  defined  here  as  a 
term  that  encompasses  a  general  analysis  of  protein  materials  that 
describes  the  use  of  structural  building  blocks  (for  example  a-helices, 
(3-sheets,  random  coils)  during  their  formation  and  function,  and  the 
process  or  mechanism  of  use  of  this  material  (for  example  synthesis, 
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breakdown,  self  assembly).  The  phenomenon  of  universality  exists 
ubiquitously  in  biology,  where  certain  protocols  are  commonly 
found  in  all  protein  materials  (such  as  the  use  of  hierarchical  levels 
of  building  blocks:  DNA  nucleotides,  DNA  double  helical  structure, 
a-helices,  (3-sheets  and  the  process  of  transcription/translation, 
protein  synthesis).  However,  other  protocols  are  highly  specialized 
(such  as  the  use  of  specific  DNA  sequences  for  a  particular  protein 
structure,  the  resultant  protein  motifs  of  tendon  fascicles,  lattice-like 
lamin  structure,  and  so  on),  thus  representing  diversity.  Protocols 
can  therefore  be  classified  as  either  universal  or  diverse. 

Universal  and  diverse  protocols  are  distributed  heterogeneously 
across  hierarchical  levels  (Fig.  3a  and  Table  2).  The  four  DNA 
nucleotides  (ACGT)  represent  a  universal  protocol  common  to 
all  protein  materials,  where  their  arrangements  in  diverse  patterns 
form  the  immense  variety  of  genetic  sequences  found  in  biology. 
Genetic  sequences  are  universally  encoded  in  a  double-helical  DNA 
structure,  regardless  of  the  specific  nucleotide  sequence.  Through  the 
universal  process  of  transcription  and  translation,  protein  molecules 
are  synthesized  into  a  ID  sequence  of  the  universal  20  amino-acid 
building  blocks.  Virtually  all  proteins  contain  one  or  more  of  these 
universally  found  motifs:  a-helices,  (3-sheets  and  random  coils. 
These  universal  motifs  arrange  into  unique,  diverse  larger-scale 
protein  structures  (for  example  enzymes,  fibres,  filaments).  A  greater 
diversity  of  protocols  is  generally  found  at  higher  levels,  suggesting 
that  biological  functionality  is  associated  with  structural  diversity. 
Universality  is  associated  with  protocols  that  can  be  used  to  derive 
diverse  functionality  at  larger  hierarchical  levels.  A  fundamental 
difference  between  engineered  materials  and  naturally  formed 
biological  materials  is  that  functionality  in  biology  can  be  created  by 
arranging  universal  building  blocks  in  different  patterns,  rather  than  by 
inventing  new  types  of  building  block.  The  formation  of  hierarchical 
arrangements  provides  the  structural  basis  that  allows  the  existence  of 
universality  and  diversity  within  a  single  material.  This  combination 
of  dissimilar  concepts  may  explain  how  protein  materials  are  capable 
of  combining  disparate  material  properties,  such  as  high  strength  and 
high  robustness,  together  with  multifunctionality. 

Biological  functionality  must  be  understood  at  varying  scales. 
Biochemistry  focuses  on  biological  functionality  at  molecular 
scales.  The  mesoscale,  which  encompasses  length  scales  ranging 
from  nanometres  to  micrometres  and  timescales  of  nanoseconds 
to  microseconds,  is  a  particularly  important  level  necessary  to 
understand  how  specific  protein  materials  derive  their  unique 
properties  and  what  role  they  play  (in  both  physiological  and 
pathological  phenomena;  Fig.  3b  ).  The  mesoscale  science  of  protein 
materials,  through  the  linking  of  molecular  properties  to  properties 
of  protein  materials  at  the  microscale,  thus  represents  an  important 
frontier  of  materials  research  with  great  potential  for  fundamental 
contributions  to  biology  and  medicine  as  well  as  for  the  de  novo 
synthesis  of  engineered  materials. 

The  approach  of  making  use  of  universal  building  blocks  to  create 
diverse  multifunctional  hierarchical  structures  has  been  successfully 
applied  in  current  macroscale  engineering  paradigms.  For  example, 
in  the  design  of  structures  such  as  buildings  or  bridges,  universal 
constituents  (bricks,  cement,  steel  trusses,  glass)  are  combined  to 
create  multifunctionality  (structural  support,  living  space,  thermal 
properties,  light  harvesting)  at  larger  length  scales.  The  challenge  of 
using  similar  concepts  that  span  to  the  nanoscale,  as  exemplified  in 
biological  protein  materials  by  merging  structure  and  material,  could 
lead  to  the  emergence  of  new  technological  concepts.  A  key  obstacle 
in  the  development  of  new  materials  lies  in  our  inability  to  control 
the  structure  formation  directly  at  multiple  hierarchical  levels.  The 
concept  of  universality  and  diversity  and  the  knowledge  gained 
from  how  to  characterize  these  materials  at  different  hierarchical 
levels  may  help  in  addressing  these  challenges. 

Natures  use  of  a  limited  number  of  universal  building  blocks, 
arranged  in  a  variety  of  ways,  is  a  limitation  as  well  as  a  strength 
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of  biological  systems.  For  example,  although  the  performance  of 
structural  tissues  in  our  body  is  poor  compared  with  most  engineered 
materials  (such  as  steel  or  concrete),  their  performance  is  remarkably 
good  considering  their  inferior  building  blocks.  Understanding  these 
material  concepts  and  their  translation  to  the  design  of  synthetic 
materials  could  provide  us  with  new  concepts  for  materials  design  based 
on  inexpensive,  abundant  and  environmentally  benign  constituents. 

Multiscale  failure  mechanisms  of  protein  materials 

The  stability  and  failure  mechanisms  of  biological  protein  materials 
play  an  integral  part  in  defining  their  functional  role  in  biological 
systems.  In  this  context,  a  bottom-up  description  must  begin  with 
the  study  of  the  properties  of  individual  protein  molecules.  A 
variety  of  experimental,  theoretical  and  computational  tools  have 
been  developed  to  assess  structure-process-property  relations  and 
to  monitor  and  predict  materials  failure  mechanisms. 

In  biological  protein  materials,  each  hierarchical  level  has  its 
own  defects  and  failure  mechanism.  Over  multiple  hierarchical 
levels,  there  are  two  primary  types  of  response.  First,  if  a  material 
is  able  to  tolerate  defects  and  failure  at  specific  scales,  it  retains 
its  overall  integrity  because  the  presence  of  multiple  hierarchical 
levels  decreases  the  overall  probability  of  catastrophic  failure. 
Second,  some  defects  and  failure  mechanisms  lead  to  a  catastrophic 
breakdown  of  the  material  (for  example  in  genetic  disease,  where 
proteins  are  misfolded  owing  to  errors  in  the  DNA  code).  Many 
physiological  conditions  exist  in  the  realm  where  the  material  is 
able  to  tolerate  defects  and  failure  mechanisms  (such  as  microcracks 
in  bone36),  and  many  diseases  originate  because  of  the  materials 
inability  to  mitigate  material  defects  and  breakdown.  In  both  cases, 
the  basic  mechanisms  of  failure  can  be  evident  through  the  breaking 
of  hydrogen  bonds,  protein  unfolding,  sliding  of  molecules  against 
each  other  and  breaking  of  crosslinks.  Failure  occurs  because  the 
intramolecular  bonding  (for  example  hydrogen  bonds)  is  crucial  to 
defining  the  structure  of  protein  building  blocks.  Similarly,  adhesion 
forces  due  to  a  charged  surface  or  hydrophobic  forces  between 
molecules,  fibrils,  fibres  and  filaments  are  elementary  to  providing 
function  at  larger  length  scales.  Often,  these  failure  mechanisms  are 
competing  and  depend  on  loading  rate,  pH,  density  of  crosslinks 
or  molecular  geometry34,54-56.  These  biological  failure  mechanisms 
can  be  analogous  to  failure  in  engineering  materials,  which  is 
mediated  by  defects  such  as  cracks,  dislocations  or  mass  transport 
along  grain  boundaries.  The  quantitative  analysis  of  multiscale 
failure  mechanisms  provides  great  potential  in  explaining  the  role 
of  materials  in  their  respective  biological  context. 

Advances  in  experimental  techniques  have  allowed  the  structural 
characterization  of  protein  materials  across  multiple  scales 
(Fig.  4).  Techniques  such  as  nuclear  magnetic  resonance  (NMR) 
spectroscopy  and  X-ray  crystallography  have  advanced  our  ability  to 
identify  3D  protein  structures57.  Site-specific  studies  using  NMR,  a 
technique  that  allows  internal  molecular  motions  to  be  probed  while 
spanning  a  range  of  timescales  with  precise  spatial  resolution,  have 
provided  insight  into  the  dynamics  of  large  protein  complexes58.  The 
analysis  of  molecular-scale  deformation  mechanisms  has  also  been 
achieved  through  use  of  X-ray  crystallography,  providing  structural 
and  temporal  information  about  mechanisms  of  deformation 
and  assembly  (for  example  in  intermediate  filaments,  tendon  or 
bone)25,32,50,59.  The  development  of  the  Protein  Data  Bank  in  the 
1970s  enhanced  this  progression,  serving  as  a  public  repository  for 
tens  of  thousands  of  3D  atomistic  protein  structures,  identifying 
the  structure  of  numerous  proteins  from  varying  species  sources60. 
X-ray  tomography  provides  insight  into  the  internal  structure  of 
a  wide  range  of  length  scales  encountered  in  protein  materials. 
The  high  penetrating  power  of  X-ray  tomography,  coupled  with  a 
near  absence  of  reflection  at  the  interface  of  dissimilar  materials, 
makes  this  an  ideal  probe  for  reconstructing  the  morphology 
and  composition  of  proteins  at  high  resolutions  (up  to  50  nm)61. 
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Figure  4  |  Tools  for  characterization  and  modelling  of  deformation  and  failure  of  biological  protein  materials.  Respective  timescale  and  length-scale 
domains  of  applicability  are  shown.  Experimental  methods  include  X-ray  diffraction,  transmission  electron  microscopy,  atomic  force  microscopy  (AFM), 
optical/magnetic  tweezers;  microelectromechanical  systems  (MEMS)  testing  and  nanoindentation.  Theoretical  and  simulation  tools  include  quantum 
mechanics/density  functional  theory  (QM/DFT),  molecular  dynamics  (MD),  coarse-grained  models  and  mesoscale  atomistically  informed  continuum 
theories,  as  well  as  continuum  models.  The  lower  part  of  the  figure  indicates  classes  or  scales  of  protein  materials  that  can  be  studied  with  these  types  of 
technique,  a,  Coarse-graining  process  in  the  development  of  mesoscale  models,  here  illustrating  the  representation  of  the  full-atomistic  description  of  a 
tropocollagen  molecule  by  a  collection  of  'super-atoms'  or  beads,  reaching  length  scales  and  timescales  several  orders  larger,  b,  Deformation  of  a  lamin 
intermediate  filament  (LIF)  using  AFM.  Adapted  and  reprinted  with  permission  from  ref.  141.  ©  2007  NPG.  c,  Tensile  stretching  experiment  of  a  collagen 
fibril  with  a  MEMS  device  (scale  bar  2  pm).  Reprinted  with  permission  from  ref.  82.  ©  2006  Royal  Society. 


Mass  spectroscopy  provides  information  about  the  chemical 
content  of  protein  materials.  Fourier-transform  infrared  (FTIR) 
spectroscopy  is  a  related  approach  that  elucidates  information  about 
chemical  bonding  and  is  an  established  tool  used  for  the  structural 
characterization  of  proteins62,63.  Developments  in  single-molecule 
force  spectroscopy  have  made  it  possible  to  study  chemical, 
mechanical  and  structural  properties  using  manipulation  techniques 
such  as  atomic  force  microscopy  (AFM)  and  optical/magnetic 
tweezers20,64  72.  The  force-measuring  capability  of  AFM  also  makes  it 
possible  to  elucidate  molecular  determinants  of  mechanical  stability 
and  the  role  of  force -induced  conformational  changes  of  proteins 
in  the  regulation  of  physiological  function73,74,  and  it  has  been  used 
to  image  the  surface  structure  and  assembly  patterns  (for  example 
in  bone  and  collagen72,75-77).  AFM  has  also  been  effectively  used  in 
combination  with  fluorescent  techniques  for  imaging,  as  reported 
in  studies  of  fibrin  stretching6.  In  a  recent  study,  in  situ  fluorescent 
labelling  was  used  to  visualize  unfolding  of  cytoskeletal  proteins  in 
cells  under  different  mechanical  loading  conditions78,  providing  a 


quantitative  link  between  cellular- scale  deformation  and  underlying 
molecular- scale  rupture  mechanisms.  The  basic  concept  of  this 
approach  is  that  protein  unfolding  due  to  rupture  leads  to  the 
exposure  of  binding  sites,  which  can  be  used  to  bind  fluorescent 
markers  for  imaging.  This  technique  has  shown  that  protein  rupture 
occurs  as  an  underlying  mechanism  in  cells  under  mechanical 
load78.  Nanoindentation  is  another  method  used  to  probe  the 
mechanical  response  of  protein  material  composition  and  structure 
at  multiple  length  scales79,  and  has  allowed  the  characterization  of 
multiscale  mechanical  behaviour,  as  demonstrated  for  bone80,81.  The 
development  of  devices  based  on  microelectromechanical  systems 
(MEMS)  provides  the  ability  to  carry  out  tensile  tests  of  micrometre¬ 
sized  samples,  as  demonstrated  for  individual  collagen  fibrils82.  The 
available  experimental  methods  cover  a  broad  range  of  timescales 
and  length  scales.  Remaining  challenges  include  specificity  with 
respect  to  the  handling  of  individual  molecules  and  filaments, 
their  control  at  very  short  timescales,  and  the  ability  to  observe 
mechanisms  directly  at  molecular  scales. 
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Figure  5  |  Chemomechanical  behaviour  of  protein  constituents. 

a,  Schematic  diagram  of  the  energy-distance  curve  of  a  protein  domain 
transitioning  from  the  folded  to  the  unfolded  state.  The  inset  depicts  this 
process  during  unfolding  of  a  convolution  in  an  a-helix  protein  domain. 
State  I  corresponds  to  the  case  when  donor-acceptor  pairs  form  hydrogen 
bonds  (H-bonds)  in  an  a-helix  convolution.  The  dotted  line  represents  a 
second  state  II,  corresponding  to  the  unfolded  configuration  of  an  a-helix 
convolution  when  hydrogen  bonds  are  broken.  The  transition  between  the 
two  local  minima  occurs  through  an  activated  state  l-ll  in  a  3D  deformation 
path  (timescale  of  rupture  is  approximately  60  ps,  where  state  l-ll  is 
reached  after  20  ps  and  state  II  after  another  40  ps;  the  concurrent 
breaking  of  all  three  to  four  hydrogen  bonds  within  a  convolution  of  the 
a-helix  represents  the  basic  mechanism  of  rupture)55,  b,  Mechanism  of 
deformation  of  a  single  tropocollagen  molecule  under  tension,  showing 
the  dynamics  of  hydrogen-bond  breaking  under  stretching  of  a  single 
tropocollagen  molecule  (hydrogen  bonds  break  at  a  rate  of  two  hydrogen 
bonds  for  every  1%  increase  in  strain)86,  c,  Resulting  effective  energy 
landscape  for  such  a  process,  showing  how  the  energy  barrier  is  reduced 
because  of  a  laterally  applied  force  F  (the  inset  displays  the  loading 
condition  of  an  a-helix  protein  domain  under  tensile  loading),  d,  Strength 
properties  of  several  protein  domains  as  a  function  of  deformation  speed 
(a-helices  and  (3-sheets,  data  plotted  from  refs  55,67,70,142).  Faster 
deformation  (that  is,  short  timescales)  generally  leads  to  an  increased 
rupture  force.  The  dotted  lines  are  included  to  guide  the  eye;  they 
approximate  regimes  of  straight  lines  in  the  plot  of  F  against  ln(v). 


Complementing  experimental  approaches,  atomistic  simulation 
methods  provide  a  bottom-up  description  of  materials  based 
on  the  dynamical  trajectory  of  each  atom,  by  considering  their 
atomic  interactions.  Molecular  dynamics  (commonly  abbreviated 
as  MD)  solves  each  atoms  equation  of  motion83.  In  classical 
molecular  dynamics,  the  3D  structure  of  an  atom  is  approximated 
by  a  point  particle.  The  numerical  integration  of  Newtons  law 
(F  =  ma ),  by  considering  appropriate  force  fields,  enables  one  to 
simulate  a  large  ensemble  of  atoms  that  represents  a  larger  material 
volume.  The  atomic  interactions  are  described  by  decomposing 
the  behaviour  of  chemical  bonds  into  stretching,  bending  or 
rotation  contributions83-85.  Because  of  computational  limitations, 
atomistic-based  simulation  approaches  are  typically  limited  to 
billions  of  particles  (that  is,  length  scales  below  micrometres)  and 
timescales  of  less  than  a  microsecond. 

Despite  the  shortcomings  with  respect  to  accessible  timescales 
and  length  scales,  the  uniqueness  of  atomistic  methods  is  the  ability 
to  directly  visualize  the  dynamics  of  deformation  and  failure.  For 


example,  results  of  molecular  dynamics  simulations  elucidated 
mechanisms  of  hydrogen-bond  rupture  in  protein  domains, 
as  shown  in  Fig.  5a  (inset)55.  Similar  hydrogen-bond  failure 
mechanisms  have  been  observed  in  other  proteins,  such  as  (3-sheet 
crystals55.  Studies  analysing  the  deformation  and  failure  mechanisms 
of  tropocollagen  molecules  found  that  initial  molecular  rotation  is 
followed  by  hydrogen-bond  breaking  and  subsequent  stretching 
of  the  proteins  polypeptide  backbone,  followed  by  covalent  bond 
rupture,  explaining  collagens  characteristic  stiffening31,86  (Fig.  5b). 
Molecular  dynamics  simulations  have  also  been  useful  in  defining 
experimental  protocols  for  in  situ  fluorescent  labelling  by  identifying 
key  rupture  mechanisms  and  exposing  domains78. 

Larger  length  scales  and  timescales  can  be  reached  through 
the  use  of  a  hierarchy  of  simulation  techniques,  integrated 
through  multiscale  methods  (Fig.  4).  These  methods  are  based 
on  the  concept  of  informing  coarser  scales  from  finer  scales, 
enabling  one  to  establish  direct  links  between  chemical  structure 
and  larger  scales.  In  multiscale  approaches,  groups  of  atoms  in 
protein  structures  are  often  represented  by  ‘super- atoms’  or  beads, 
thereby  effectively  reducing  the  degrees  of  freedom  (a  coarse- 
graining  approach  for  tropocollagen  is  shown  in  Fig.  4a).  The 
direct  comparison  between  molecular  dynamics  simulation  (in 
particular  coarse-graining  methods)  and  experiment  has  become 
more  viable  owing  to  the  emergence  of  experimental  tools  that 
reach  ever  smaller  scales. 

At  molecular  and  submolecular  scales,  an  integrated  chemo¬ 
mechanical  approach  is  mandatory  to  describe  the  failure 
properties  of  biological  protein  materials.  Theoretical  approaches 
that  link  chemistry  and  mechanics  are  often  derived  from  a 
phenomenological  theory  originally  postulated  by  Bell  more  than 
30  years  ago87-90.  Several  sophisticated,  advanced  models  are  now 
available91-96  and  the  ‘original’  Bell  model  is  less  often  used  in  the 
analysis  of  protein  rupture.  However,  the  ideas  behind  Bell’s  model 
provide  the  intellectual  foundation  for  many  theories.  The  basic 
concept  of  the  interplay  of  applied  forces  and  the  resulting  energy 
landscape  change  is  illustrated  in  Fig.  5c.  The  Bell  model  is  based 
on  a  statistical  approach  to  describe  the  likelihood  of  bond  rupture 
under  the  presence  of  an  energy  barrier  £b,  which  is  reduced  due 
to  an  applied  force  F  at  the  failure  point  xh.  The  probability  for  a 
bond  to  break  is  expressed  using  an  Arrhenius  expression,  which 
predicts  that  the  lifetime  of  a  bond  decreases  exponentially  as  the 
energy  barrier  decreases  under  the  applied  force.  Expressions  that 
relate  the  failure  force  F  to  the  pulling  rate  v  applied  to  a  protein 
predict  a  characteristic  logarithmic  dependence  of  the  rupture 
force  as  a  function  of  timescales  or  pulling  rate  v,  resulting  in 
a  dependence  of  the  form  F(v;  xh  Eh)  =  a(xh,T)  ln(v)  +  b(xh,  Eh,T) 
(where  a  and  b  are  parameters  that  depend  on  the  energy  landscape 
and  temperature  T). 

Figure  5d  displays  a  set  of  experimental  and  computational 
results,  illustrating  this  type  of  relationship.  Each  straight  line 
reflects  a  particular  rupture  mechanism,  characterized  by  an  energy 
barrier  Eh  and  a  transition  state  distance  xh.  Variations  of  pulling 
rates  often  induce  a  change  in  failure  mechanism.  The  analysis 
of  the  results  of  the  pulling  rate  dependence  of  forces  allows  the 
interpretation  of  experimental  and  computational  results  through 
the  determination  of  effective  energy  barriers  and  transition  states, 
and  provides  insight  into  microscopic  failure  mechanisms. 

The  original  Bell  model  does  not  provide  a  direct  link  between  the 
atomistic  chemical  structure  of  a  bond  (or  arrangements  of  bonds) 
and  the  resulting  behaviour.  For  instance,  the  effective  energy  barrier 
shown  in  Fig.  5c  may  represent  a  cluster  of  several  weak  bonds  that 
break  simultaneously,  or  a  single,  stronger  bond.  The  model  further 
does  not  capture  the  effect  that  the  energy  landscape  itself  may 
change  as  the  protein  deforms.  Other  models  have  been  reported 
that  describe  the  strength  of  proteins  in  dependence  on  the  force 
rate  (the  rate  of  increase  of  force  over  time)91-96.  Some  models  also 


182 


NATURE  MATERIALS  |  VOL  8  |  MARCH  2009  |  www.nature.com/naturematerials 


NATURE  MATERIALS  DOI:  10.1038/NMAT2387 


REVIEW  ARTICLE 


Figure  6  |  Integration  of  theory,  simulation  and  experimental  techniques,  a,  Results  of  an  elastic  network  model  calculation,  used  to  describe  the 
anisotropic  failure  properties  of  a  larger  protein  structure.  Reprinted  with  permission  from  ref.  98.  ©  2008  APS.  b,  Entropic  elastic  behaviour  of  stretching  a 
single  tropocollagen  (TC)  molecule,  comparing  experiment  (optical  tweezers),  coarse-grained  mesoscale  simulation  (molecular  dynamics,  MD)  and  theory 
(worm-like-chain  model,  WLC).  Data  replotted  from  refs  31  and  103;  x/i  is  the  ratio  of  extension  x  to  contour  length  L  c,  Young's  modulus  prediction  of  a 
single  tropocollagen  molecule.  The  graph  shows  a  comparison  between  experimental  and  full-atomistic  molecular  dynamics  simulation  results  (based  on 
averages  from  literature  sources  summarized  in  ref.  86;  error  bars  correspond  to  standard  deviation),  d,  Comparison  between  experimental  and  theoretical 
modelling  of  stretching  a  long  titin  filament,  obtained  through  the  application  of  a  theoretical  approach  that  involves  the  Bell  model  and  a  worm-like-chain 
description  for  protein  elasticity.  Reprinted  with  permission  from  ref.  104.  ©  1998  APS. 


include  a  description  of  the  transition  between  multiple  transition 
states,  effectively  describing  the  occurrence  of  several  straight  lines 
in  the  F-ln(v)  domain91-96  (similar  to  the  overall  behaviour  shown 
in  Fig.  5d). 

Challenges  in  linking  simulation  and  experiment  include 
differences  in  accessible  timescales  and  length  scales.  For  example, 
the  computational  prediction  of  the  3D  folded  structure  of  proteins 
directly  from  the  amino  acid  sequence  is,  despite  advances  in  recent 
years,  still  at  a  relatively  early  stage  and  can  typically  only  be  applied 
reliably  to  simpler  protein  structures,  as  the  timescales  for  protein 
folding  are  beyond  current  computational  capabilities97. 

Because  biological  protein  materials  feature  properties  that 
are  strongly  length-scale  and  timescale  dependent,  the  explicit 
consideration  of  particular  scales  for  the  quantitative  comparison 
of  measurements  and  theory  is  essential.  Measurements  at  different 
timescales  and  length  scales  lead  to  significant  variations  in  material 
properties.  For  timescales,  this  can  be  observed  in  the  results  of 
strength  properties  reviewed  in  Fig.  5d55,67,70.  This  plot  explains  why 
many  molecular  dynamics  simulations  predict  a  vastly  different 
behaviour  from  experimental  studies,  because  they  are  carried  out 
at  different  timescales.  An  implication  of  these  strong  size  effects 
is  that  one  cannot  directly  use  single-molecule  measurements  to 
infer  larger-scale  properties,  and  that  homogenization  approaches 
commonly  used  for  crystalline  materials  cannot  be  applied  directly. 
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A  multiscale  analysis  that  explicitly  calculates  or  measures  material 
properties  at  multiple  scales  is  compulsory. 

Despite  these  challenges,  several  studies  have  led  to  direct 
comparison  between  experiment  and  theory.  Protein  strength 
models  have  been  extended  to  more  complex  protein  geometries, 
a  vital  step  to  arrive  at  structure-process-property  relationships  of 
strength  properties98-100.  Figure  6a  shows  an  elastic  bond  network 
model  for  protein  unfolding  mechanics,  which  combines  an  elastic 
model  of  a  network  of  bonds  with  a  bond  fracture  model  based 
on  Bells  concept98.  The  model,  here  applied  to  green  fluorescent 
protein,  treats  bonds  as  springs  and  provides  a  direct  structure- 
property  relationship  between  the  geometry  and  bond  rupture 
(for  example  the  proteins  mechanical  anisotropy).  It  is  capable  of 
accurately  reproducing  experimental  results  reported  earlier  for  the 
same  protein  structure101  (for  comparison  see  Fig.  6a). 

A  major  success  is  the  use  of  the  worm -like -chain  model  to 
describe  entropic  elasticity69,102.  The  use  of  Bell-type  strength 
models  in  conjunction  with  worm -like -chain  models  resulted  in 
quantitative  agreement  between  theory  and  experiment.  Figure  6b 
depicts  the  comparison  of  an  optical  tweezers  experiment  of 
stretching  a  single  tropocollagen  molecule  with  results  from 
molecular  dynamics  simulation  (based  on  the  mesoscale 
coarse-grained  model  shown  in  Fig.  4a)  and  the  theoretical 
worm -like -chain  model31,102,103.  All  three  approaches  provide  a 
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consistent  description  of  the  protein’s  entropic  elastic  behaviour. 
Measurements  and  theoretical  predictions  of  the  stiffness  of  single 
tropocollagen  molecules  at  larger  deformation  (when  hydrogen 
bonds  within  the  molecules  are  being  stretched)  are  compared  in 
Fig.  6c,  revealing  good  agreement  between  various  experimental 
and  molecular  dynamics  simulation  approaches.  Under  large 
forces  and  displacements,  the  worm -like -chain  model  fails  and 
rupture  of  hydrogen  bonds  within  protein  domains  dominates 
(see  Fig.  5a,  b)69.  This  behaviour  has  been  captured  in  quantitative 
models  that  combine  the  worm -like -chain  model  with  the  Bell 
model  as  reported  previously104,105.  Figure  6d  shows  a  comparison 
between  experimental  and  theoretical  results  based  on  an 
elastically  coupled  protein  model  that  predicts  elastic  and  strength 
properties104.  The  theoretical  prediction  and  experimental  results 
are  in  good  agreement,  suggesting  that  this  model  can  accurately 
reflect  the  interplay  of  strength  and  elasticity. 

A  recent  paper  further  illustrates  the  integration  of  experiment 
and  theory,  where  measurements  of  molecular  rupture  forces 
between  single  actin  filaments  and  two  actin-binding  proteins 
provide  key  insight  into  the  competition  of  rupture  mechanisms 
under  different  pulling  rates54.  It  was  observed  that  whereas  the 
breaking  of  a-actinin/actin  clusters  dominates  at  slow  pulling  rates, 
the  breaking  of  filamin/actin  clusters  governs  at  higher  pulling  rates. 
This  quantitative  understanding  of  competing  mechanisms  is  a 
crucial  element  in  the  development  of  failure  models  of  soft  tissues. 
Studies  of  fibrin  provide  another  example  of  a  protein  material 
where  experiment,  theory  and  simulation  have  been  effectively 
combined  to  improve  our  understanding  of  its  material  properties6,7. 
The  initial  observation  of  extreme  extensibility  of  fibrin  (>300% 
strain)6  motivated  single -molecule  studies8  as  well  as  molecular 
dynamics  simulations9,  which  suggested  that  the  particular  coiled- 
coil  structure  of  this  protein  provides  the  structural  basis  for  fibrins 
great  extensibility. 

Role  of  protein  materials  in  disease 

Throughout  the  past  century,  the  focus  on  addressing  diseases 
has  derived  primarily  from  a  biochemical  approach.  However, 
advancements  and  increased  understanding  have  yielded  a  greater 
appreciation  of  the  role  of  materials  science  of  protein  materials  in 
various  medical  disorders.  In  this  section,  the  role  of  biologically 
relevant  material  properties  in  the  progression  or  activation  of 
diseased  states  will  be  discussed. 

Alzheimer’s  disease,  Parkinson’s  disease,  type  II  diabetes  and 
prion  diseases106-109  have  been  linked  to  the  formation  of  foreign 
material  deposits  in  tissues.  These  material  deposits,  referred  to 
as  amyloid  plaques,  are  highly  ordered  hierarchical  assemblies  of 
(3-sheet  protein  domains  that  form  spontaneously  (Fig.  7a)110-113. 
Once  formation  is  initiated,  amyloid  plaques  grow  uncontrollably 
to  length  scales  of  micrometres  and  persist  under  a  wide  range  of 
pH  conditions.  The  mechanical  robustness  of  these  plaques  has 
been  attributed  to  the  large  number  of  hydrogen  bonds  as  well  as  to 
steric  and  hydrophobic  interactions  between  different  parts  of  the 
(3-sheet  structures110,114,115,  although  the  exact  structure-process- 
property  relationships  for  this  material  remain  to  be  investigated. 
The  properties  of  amyloid  fibrils  have  been  probed  using  AFM 
techniques  and  molecular  dynamics  simulations55,68,111,116,117.  It  is 
understood  that  the  formation  of  amyloid  plaque  deposits  leads 
to  neurotoxicity,  which  interferes  with  the  biological  function 
of  the  native  tissue108,109.  The  mechanical  robustness  of  amyloid 
plaques  and  the  body’s  failure  to  eliminate  these  material  deposits 
remain  the  primary  reasons  for  our  inability  to  reverse  the 
progression  of  this  disease.  An  improved  understanding  of  how 
the  hierarchical  structure  of  amyloid  plaques  contributes  to  their 
extreme  mechanical  stability  could  lead  to  new  strategies  for 
treatment  through  targeting  selective  breakdown  of  these  material 
deposits  in  situ. 
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Hutchinson-Gilford  progeria  syndrome,  a  genetic  rapid  ageing 
disease,  is  caused  by  a  structural  defect  in  the  lamin  nuclear 
membrane22,118-120.  The  progression  of  this  disease  has  been 
associated  with  mechanical  failure  of  the  cell’s  nuclear  membrane 
in  tissues  subject  to  mechanical  loading.  A  recent  study  based 
on  live-cell  imaging  and  micropipette  aspiration  has  shown  that 
progeria  nuclear  membranes  display  a  reduced  deformability  and 
feature  the  formation  of  fractures  on  application  of  mechanical 
load22.  The  cause  of  these  fractures  is  attributed  to  changes  in  the 
lamin  microstructure,  where  filaments  form  more  ordered  domains 
that  prevent  the  dissipation  of  mechanical  stress.  This  structural 
alteration  leads  to  a  change  in  the  deformation  mechanism,  from 
a  dissipative  mode  (‘ductile’)  in  healthy  cells  to  a  catastrophic, 
localized  failure  mode  (‘brittle’)  in  diseased  cells  (Fig.  7b,  c).  These 
mechanisms  appear  predominantly  in  cells  that  are  subjected  to 
mechanical  deformation,  particularly  in  endothelial  and  smooth 
muscle  cells  of  the  vascular  system.  Perhaps  a  loss  of  mechanical 
integrity  in  the  cell’s  nuclear  membrane  can  influence  gene  regulation 
by  triggering  a  wide  range  of  biochemical  processes  that  lead  to  the 
rapid  ageing  phenomenon.  However,  the  exact  molecular  failure 
mechanisms  remain  unknown  and  their  investigation  represents  an 
opportunity  for  future  research  where  a  materials  science  approach 
could  make  an  important  contribution.  This  disease  illustrates  how 
material  failure  due  to  structural  flaws  within  a  protein  material 
can  lead  to  the  breakdown  of  critical  biological  components.  Many 
other  genetic  diseases  resulting  from  structural  flaws  in  the  lamin 
protein  network  have  been  identified,  generally  referred  to  as 
Taminopathies’118-120. 

Other  genetic  disorders  in  collagenous  tissues  have  been  linked 
to  the  alteration  of  the  material  structure  owing  to  mutations  in 
the  genes  that  encode  the  tropocollagen  molecule.  Osteogenesis 
imperfecta  is  a  genetic  disease  that  increases  the  susceptibility  of 
bones  to  catastrophic  brittle  fracture.  The  origin  of  this  disease 
resides  in  changes  to  the  structure  of  tropocollagen  molecules  caused 
by  the  substitution  of  a  single  glycine  amino  acid  (Fig.  7d)121,122. 
Some  collagen  mutations  prevent  the  formation  of  triple  helical 
molecules  (procollagen  suicide),  whereas  other  mutations  cause 
structural  changes  to  tropocollagen  molecules,  leading  to  bending 
(caused,  for  example,  by  kinks  induced  by  amino  acid  substitutions), 
reduced  mechanical  stiffness  (caused,  for  example,  by  changes  in 
the  volume  and  hydrophobicity),  or  changes  in  the  intermolecular 
adhesion  (caused,  for  example,  by  changes  in  surface  charges)121-123. 
At  mesoscopic  length  scales,  these  molecular-level  changes  lead  to 
poor  fibril  packing124,125  and  a  decrease  in  crosslink  density125,126. 
Changes  in  the  size  and  shape  of  mineral  crystals  in  bone  (for 
example  less  organized,  more  rounded  crystals)  have  also  been 
reported127-129  and  are  possibly  due  to  a  change  in  the  ability  of 
tropocollagen  to  bind  to  the  mineral  phase  of  bone130,131.  At  larger 
length  scales,  the  effects  of  osteogenesis  imperfecta  mutations  lead 
to  inferior  mechanical  properties  of  tendon  and  bone132  (Fig.  7d).  A 
mechanically  inferior  collagen  matrix  in  addition  to  an  increased 
and  less  organized  mineral  content,  and  an  overall  reduced  bone 
volume  caused  by  reduced  bone  turnover133,  may  explain  the 
phenomenon  of  brittle  bones,  an  important  feature  of  osteogenesis 
imperfecta125.  A  related  disease  in  which  collagen  mutations  play  a 
key  role  is  Alport’s  syndrome,  in  which  a  mutation  in  the  collagen 
gene  alters  the  glomerular  basement  membrane  of  kidneys134.  The 
progression  of  mechanical  failure  of  the  glomerular  basement 
membrane  eventually  leads  to  the  kidney’s  inability  to  filter  blood, 
resulting  in  renal  failure. 

The  change  of  cell  stiffness  in  cancer  cells  has  been  shown 
by  using  a  microfluidic  optical  cell  stretcher135.  Studies  of  AFM 
indentation  of  cells  confirmed  that  cancer  cells  display  a  reduced 
stiffness  compared  with  that  of  healthy  cells136  (Fig.  7e).  This 
mechanical  signature  of  cancer  cells  could  perhaps  be  used 
to  define  new  diagnostic  approaches  in  cancer  detection.  The 
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Figure  7  |  Role  of  changes  in  biological  protein  material  properties  in  diseases,  a,  Structure  of  amyloids,  spanning  from  the  chemical  structure 
of  individual  hydrogen  bonds  to  the  scale  of  amyloid  plaques.  Included  is  a  visualization  of  molecular  models  proposed  for  amyloid  fibrils  ('cross- 
(3-structure'  and  '(3-helix').  Protofilament  structure  reprinted  with  permission  from  ref.  143.  ©  NPG.  b,  Effect  of  lamin  mutations  in  progeria  on  the 
mechanical  properties  of  the  cell  nuclear  membrane  (scale  bar  5  pm,  fluorescent  imaging).  The  cell  nuclear  membrane  forms  localized  fractures  under 
large  mechanical  loading,  c,  Change  in  the  lamin  microstructure  due  to  genetic  mutation,  and  effect  on  deformation  mechanism  (b  and  c  reprinted 
with  permission  from  ref.  22.  ©  2006  National  Academy  of  Sciences  USA),  d,  Geometry  of  point  mutations  in  osteogenesis  imperfecta  (01)  in  the 
tropocollagen  molecule  (upper  part),  and  effect  on  mechanical  properties  of  tendon,  where  the  maximum  strain  and  maximum  stress  of  tendon  are 
severely  reduced  under  mutations  (lower  part,  data  plotted  from  ref.  132).  e,  Mechanical  signature  of  cancer  versus  normal  cells,  revealing  that  cancer 
cells  are  softer  than  normal  cells.  Adapted  and  reprinted  with  permission  from  ref.  136.  ©  1997  NPG. 


approach  based  on  microfluidic  stretching135  may  provide  an 
effective  high-throughput  platform  for  cancer  diagnosis.  Studies  of 
cell  mechanics  using  optical  tweezers  have  revealed  that  malaria- 
infected  cells  become  stiffer,  preventing  normal  blood  flow 
through  lung  capillaries137.  The  elastic  cellular  microenvironment 
(for  example  the  stiffness  of  collagen-based  extracellular  matrix) 
has  been  shown  to  control  stem  cell  differentiation  into  neuron, 
muscle  or  bone  cells138. 

These  case  studies  illustrate  that  the  change  of  material  properties 
is  a  crucial  element  in  many  diseases.  Translation  of  this  knowledge 
would  allow  detection  of  diseases  by  measuring  material  properties 
rather  than  by  focusing  on  symptomatic  biochemical  readings 
alone.  Altogether,  understanding  the  role  of  different  hierarchical 
levels  of  protein  materials  in  diseases  could  potentially  bring  about  a 
new  paradigm  of  approaches  to  address  medical  disorders;  however, 
further  research  is  needed  to  elucidate  the  underlying  multiscale 
failure  mechanisms. 
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Future  directions 

Along  with  biomedical  applications,  understanding  of  the  material 
concepts  in  protein  materials  could  also  be  used  to  advance 
nanomaterials  for  engineering  applications.  Material  properties  of 
nanomaterials  are  often  found  to  be  superior  to  those  of  conventional 
engineering  materials.  For  instance,  carbon  nanotubes  are  one  of  the 
strongest  materials  known  but  have  rarely  been  used  in  structural 
materials  because  of  our  inability  to  use  their  properties  at  larger 
length  scales.  Understanding  protein  materials  could  provide 
knowledge  that  would  eventually  be  translated  to  connecting 
disparate  material  scales,  from  nanoscales  to  macroscales.  We  may 
then  be  able  to  apply  these  material  concepts  to  make  use  of  the 
full  potential  (higher  strength,  higher  robustness)  of  multiscale 
engineered  materials.  An  intriguing  application  may  be  the 
development  of  a  new  material  platform  based  on  concepts  similar 
to  those  found  in  biological  protein  materials.  However,  rather  than 
using  the  universal  protein  constituents  (such  as  a-helices,  (3-sheets, 
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random  coils  or  tropocollagen),  we  could  instead  use  building 
blocks  such  as  carbon  nanotubes,  nanowires  and  self-assembled 
peptides.  Some  steps  towards  this  goal  have  been  taken  in  recent 
studies,  where  conducting  metallic  nanowires  were  grown  through 
the  use  of  amyloids  as  templates139. 

Biology  makes  use  of  hierarchical  structures  in  an  intriguing 
way  to  create  multifunctional  materials.  Even  though  biochemical 
and  image-based  diagnostics  will  remain  important,  the  integration 
of  scales,  as  well  as  the  mixing  of  physical,  biological  and  chemical 
concepts  into  new  engineering  designs,  could  complement  the 
current  practice  of  disease  diagnosis  and  treatment,  as  well  as  the 
design  of  new  materials,  and  thereby  unfold  many  opportunities  for 
technological  innovations.  The  wide  impact  of  the  use  of  materials 
science  approaches  in  biology  and  biomedical  sciences  has  yet  to 
be  demonstrated.  In  some  specific  areas  such  as  bone  (for  example 
osteoporosis),  cartilage  (for  example  arthritis),  cardiovascular 
aspects  and,  particularly,  in  the  context  of  tissue  engineering  and 
regenerative  medicine,  materials  science  approaches  have  started  to 
play  an  important  part  in  the  biomedical  literature. 

In  addition  to  mechanical  deformation  and  failure  mechanisms 
of  protein  materials,  the  investigation  of  other  material  properties 
could  be  of  great  interest.  Optical  properties  (for  example  in 
the  eyes  cornea  and  lens),  photoelectric  properties  (for  example 
photosynthesis  in  plants),  electrical  (for  example  synapses,  the  links 
between  neuron  cells),  motility  (for  example  in  muscle  tissues)  or 
thermal  properties  (for  example  thermal  management)  are  critically 
important  and  could  be  studied  using  a  materials  science  approach. 
The  study  of  materials  failure  could  provide  an  interesting  platform 
to  advance  our  understanding  of  diseases. 
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